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® A method of producing a polypeptide product end a plaamldle axpresslon vahlda therefor, a method of creating an 
expression ptasmld, a method of cleaving double atranded DNA, and ipedflc plasmlds. 

@ Novel plasmldic expression vehicles and methods of 

using them In the production of useful polypeptides t>y 

recombinant bacteria are described. The plasmlds employ a 

tryptopftan promoter*operator system from which the 

Attenuator region ordinarily present has been deleted. Bae- 

taria eomahitno.the plasmlds can accordingly be repressed 

by the addition of tryptophan against expression of desired 

polypepUdes coded for by inserted genes whBe they are 

grown to leweb suitable for industrial-scate production. 
01 M^the tryptophan may then be withdrawn, essentially 
^ derepressing the pathway and permitting efflcient produc- 

tion of the desired product In high yield. 

(0 

r* 

r^ 

8 

O 



0. 
Ul 




Acraauu AA 



4) 



0036776 



A METHOD OF PRODOCING A" POiaPEPTIDE BRODOCT 
AND A PLASMIDIC EXPRESSICN VEHia£ THEREFOR, 
A MEIHOD CF CRERIIN5 AN EXPBESSION PIASMID, 
A Mb'lMUD OF CLEAVING DOCBtJS STOMCED TX9i, 
AND SPECIFIC PIAS^aDS. 



BACKGROUND OF THE INVENTION 

• With the advent of recombinant DNA technology, the controlled 
bacterial production of an enormous variety of useful polypeptides has. 
become possible. Already in hand are bacteria modified by this 
technology to permit the production of such polypepti<ie products such as 
somatostatin (K. Itakura, et ai.. Science 198, 1056 [1977]). the 
(component) A and B chains of human insulin (D.V. Goeddel, et al., Proc 
Nat'l Acad Sci. USA 76. 106 [1979]), and human growth hormone (O.V. 
Goeddel, et al.. nature 28|,. 544 [1979]). More recently, recombinant 
DNA techniques have been used to occasion the bacterial production of 
thymosin alpha 1. an immune potentiating substance produced by the 
thymus. Such is the power of the technology that virtually 
any useful polypeptide can be bacterially produced, putting 
within reach the controlled manufacture of hormones, 
enzymes, antibodies, and vaccines against a wide variety 
of diseases. The cited materials, which describe 
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in greater detail the representative examples referred to above, are 
incorporated herein by reference, as are other publications referred to 
^nfra, to illuminate the background of the Invention. 

The work horse of recombinant DMA technology is the plasmid, 
5 . non-chromosomal loop of double-stranded ONA found in bacteria, 
oftentimes in multiple copies per bacterial cell. * Included in the 
in€onnation encoded In the .plasmid DNA is that required to reproduce the 
plasmid in daughter cells (i.e.,* a "replicon") and ordinarily, one. or 
more selection characteristics, such as resistance to antibiotits, which 
10 permit clones of the host cell containing the plasmid of interest to be 
recognized and preferentially grown in selective media. The utility of 
bacterial plasmids lies in the fact that they can be specifically 
cleaved by one or another restriction endonuclease or "restriction 
enzyme", each of which recognizes a different site on the plasmidic 
15 DNA. Thereafter heterologous genes or gene fragments may.be inserted 
into the plasmid by endwise joining at the cleavage site or at 
reconstructed ends adjacent 'the cleavage site. As used herein, the term- 
"heterologous" refers to a gene not 'ordinarily found in, or a 
polypeptide sequence ordinarily not produced by, Z, coli . whereas the 
20 term "homologous" refers to a gene or polypeptide which is produced in 
wild-type £. coll . ONA recombination is performed outside the bacteria, 
but the resulting "recombinant"' plasmid can be introduced into bacteria 
by a process known as transformation and large quantities of the 
heterologous gene-containing recombinant plasmid obtained by growing the 
transfonhant. Moreover, where the gene is properly inserted with 
reference to portions of the plasmid which govern the transcription and 
translation of the encoded ONA message, the resulting expression vehicle 
can be used to actually produce the polypeptide sequence for which the 
inserted gene codes, a process refemd to as expression. 

Expression is initiated in a region known as the promoter which is 
recognized by and bound by RNA polymerase. In some cases, as in the trp 
operon discussed infra , promoter regions are overlapped by "operator" 
regions to form a combined promoter-operator. Operators are DNA 
sequences which are recognized by so-called repressor proteins which 
serve to regulate the frequency of transcription initiation at a 
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particular promoter. The polymerase travels along the ONA, transcribing 
the information contained in the coding strand from its 5* to 3* end 
. into messenger RNA which is in turn translated into a polypeptide having 
the amino acid sequence for which the ONA codes. Each amino acid is 

5 • encoded by a unique nucleotide triplet or "codon" within what •*« for 
present purposes be referred to as the "structural gene", i.e. that part 
iJhich encodes the amino acid sequence of the expressed product. After 
binding to the promoter, the RNA polymerase first transcribes 
nucleotides encoding a ribosome binding site, then a translation 

10 initiation or "start" signal (ordinarily ATG, which in the resulting 
messenger RNA becomes AUG), then the nucleotide codons within the 
structural gene itself. So-called stop codons are transcribed at the 
end of the structural gene whereafter the polymerase may form an 
additional sequence of messenger RNA which, because of the presence of 

15 the stop signal, will remain untranslated by the ribosomes. Ribosomes 
bind to the binding site provided on the messenger RNA, in bacteria 
ordinarily as the mRNA is being formed, and themselves produce the 
encoded polypeptide, beginning at the translation start signal and 
ending at the previously mentioned stop signal. The desired product is 

20 produced if the sequences encoding the ribosome binding site are 

positioned properly with respect to the AUG initiator codon and if all 
remaining codons follow the initiator codon In phase. The resulting 
product may be obtained by lysing the host cell and recovering the 
product by appropriate purification from other bacterial protein. 

25 Polypeptides expressed through the use of recombinant ONA 

technology may be entirely heterologous, as in the case of the direct 
expression of human growth hormone, or alternatively may comprise a 
heterologous polypeptide and, fused thereto, at least a portion of the 
amino acid sequence of a homologous peptide, as in the case of the 

30 production of intermediates for somatostatin and the components of human 
insulin. In the latter cases, for example, the fused homologous 
polypeptide comprised a port ion of the amino acid sequence for beta 
galactosidase. In those cases, the intended bioactive product is 
bioinactivated by the fused, homologous polypeptide until the latter Is 

35 cleaved away in an extracellular environment. Fusion proteins like 
those Just mentioned can be designed so as to permit highly specific 
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cleavage of the precusor protein from the Intended product, as by the 
action of cyanogen bromide on methionine, or alternatively by enzymatic 
cleavage. See, eg., G.B. Patent Publication No. 2 007 676 A. 

If recombinant ONA technology Is to fully sustain Its promise, 
5 systems must be devised which optimize expression of gene inserts, so 
that the intended polypeptide products can be made available in high 
yield. The beta lactamase and lactose promoter-operator systems most 
commonly used in the past, while useful, have not fully utilized the 
capacity of the technology from the standpoint of yield. A need has 
10 -existed for a. bacterial expression vehicle capable of the controlled 
expression of desired polypeptide products In higher yield. 

Tryptophan is an amino acid produced by bacteria for use as a 
component part of homologous polypeptides in a blosynthetic pathway 
which proceeds: chorlsmic acid-* anthrani lie acid-^phosphoribosyl 

15 an thranilac acid CORP Cenol-l-(o-carboxyphenylamino)-l-desoxy-D- 
ribulose-S-phosphate]-*- indol-3-glycerol-phosphate, and ultimately to 
tryptophan Itself. The enzymatic reactions of this pathway are 
catalyzed by the products of the tryptophan or "trp" operon, a 
polycistronic ONA segment which Is transcribed under the direction of 

20 the trp promoter-operator system. The resulting polycistronic messenger 
RNA encodes the so-called trp leader sequence and then, in order, the 
polypeptides referred to as trp E, trp Q, trp C, trp B and trp A. These 
polypeptides variously catalyze and control Individual steps in the 
pathway chorlsmic acid tryptophan. 

25 In wild-type E. coll, the tryptophan operon is under at least three 

distinct forms of control. In 'the case of promoter-operator repression, 
tryptophan .acts as a corepressor and binds to its aporepressor to form 
an active repressor complex which, in turn, binds to the operator, 
closing down the pathway in its entirety. Secondly, by a process of 

30 feedback inhibition, tryptophan binds to a complex of the trp E and trp 
0 polypeptides, prohibiting their participation in the pathway 
synthesis. Finally, control is effecteu by a process known as 
attenuation under the control of the "attenuator region" of the gene, a 
region within the trp leader sequence. See generally 6.F. Mlozzari 
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et ai, J. Bacterlolooy 133. 1457 (1978); The Operon 263-302. Cold Spring 
Harbor Laboratory (1978), Miller and Reznikoff, eds.; F. Lee et ai, 
Proc. Natl. Acad. Sci. USA 74. 4365 (1977) and K. Bertrand et al, J. 
Mol. Biol. 103. 319 (1975). The extent of attenuation appears to be 
governed by the intracellular concentration of tryptophan, and In 
wild-type E. coli the attenuator terminates expression in approximately 
nine out of ten cases, possibly through the formation of a secondary 
structure, or "terminatfon loop', in the messenger RNA which causes the 
RNA polyrtferase to prematurely disengage from the associated DNA. 



Other workers have employed the trp operon to ol?tain some measure 
of heterologous polypeptide expression. This work, it is believed, 
attempted to deal with problems of repression and attenuation by the 
addition of-indole acrylic acid, an inducer and analog which competes 
with tryptophan for trp repressor molecules, tending toward derepression 
15 by competitive inhibition. At the same time the inducer diminishes 
attenuation by inhibiting the enzymatic conversion of indole to 
tryptophan and thus effectively depriving the cell of tryptophan. As a 
result more polymerases successfully read through the attentuator. 
However, this approach appears problematic from the standpoint of 
completing translation consistently and in high yield, since 
tryptophan-containing protein sequences are prematurely terminated in 
synthesis due to lack of utilizable tryptophan. Indeed, an effective 
relief of attenuation by this approach is entirely dependent on severe 
tryptophan starvation. 

25. The present invention addresses problems associated with tryptophan 
repression and attenuation in a different manner and provides (1) a 
method for obtaining an expression vehicle designed for direct 
expression of heterologous genes from the trp promoter-operator, (2) 
methods for obtaining vehicles designed for expression, from the 

30 tryptophan operator-promoter, of specifically cleavable polypeptides 
coded by homologous-heterologous gene fusions and (3) a method of 
expressing heterologous polypeptides controllably. efficiently and in 
high yield, as well as the associated means. 
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SUMMARY OF THE INVENTION 



According to the present invention, novel plasnidic expression 
•vehicles are provided for the production in bacteria of heterologous 
polypeptide products, the vehicles having a sequence of double-stranded 

$ DNA comprising, in phase from a first 5' to a second 3' end of the 
coding strand, a trp pronoter-operator, nucleotides coding for tfie trp 
leader ribosoine binding site, and nucleotides encoding translation 
initiation for expression of a structural gene that encodes the amino 
acid sequence of the heterologous polypeptide. The DMA sequence referred 

10 to* contains neither a trp attenuator region nor nucleotides coding for 
the trp E ribosome binding site. Instead, the trp leader ribosome 
binding site is efficiently used to effect expression of the information 
encoded by an inserted gene. 

Cells are transformed by addition of the trp promoter-operator-* 

15 containing and attenuator-lacking plasraids of the Invention and grown up 
In the presence of additive tryptophan. The use of tryptophan-rich 
media provides sufficient tryptophan to essentially completely repress 
the trp promoter-operator through trp/repressor Interactions, so that 
cell growth can proceed uninhibited by premature expression of large 

20 quantities of heterologous polypeptide encoded by an Insert otherwise 
under the control of the trp promoter-operator system. When the 
recombinant culture has been grown to the levels appropriate for 
industrial production of the polypeptide, on the other hand, the 
external source of tryptophan is removed, leaving the cell to rely only 

25 on the tryptophan that it can itself produce. The result is mild 

tryptophan lioitation and, accordingly, the pathway is derepressed and 
highly efficient expression of the heterologous insert occurs, 
unhampered by attenuation because the attenuator region has been deleted 
from the system. In this manner the cells are never severely deprived 

30 of tryptophan and all proteins, whether they contain try^tbphan or not, 
can be produced In substantial yields. 

The Invention further provides means of cleaving double-stranded 
DNA at any desired point, even absent a restriction enzyme site, a 
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technique useful In, mong other things, the creatlon'of trp operons 
having attenuator deletions other than those previously obtained by 
selection of mutants. 

Finally, the Invention provides a variety of useful Intermediates 
5 and endproducts. Including specifically cleavable heterologous- 

homologous fusion proteins that are stabilized against degradation under 
expression conditions. 

The manner In which these and other objects and advantages of the 
Invention are obtained will become more apparent from the detailed 
10 description which follows and from the accompanying drawings In which: 

Figures 1 and 2 Illustrate a preferred scheme for forming plasmids 
capable of expressing heterologous genes as fusions with a 
portion of the trp D polypeptide, from which fusion they may 
be later cleaved; 

15 Figure 3 is the result of polyacryl amide gel segregation of cell 

protein containing homologous (trp 0') - heterologous 
(somatostatin or thymosin a I) fusion proteins; 
Figures 4, 5 and 6 illustrate successive stages in a preferred 
scheme for the creation of a plasmid capable of directly 

20 expressing a heterologous gene (human growth hormone) under 

the control of the trp promoter-operator system; 
Figure 7 is the result of polyacryl amide gel segregation of cell 
protein containing human growth hormone directly expressed 
under the control of the trp promoter-operator system; 

25 Figures 8,9 (a-b) and 10 Illustrate in successive stages a 

preferred scheme for the creation of plasmids capable of 
expressing heterologous genes (In the Illustrated case, for 
somatostatin) as fusions with a portion of the trp C ' 
polypeptide, from which fusions they may be later cleaved;* 

30 Figure U is the result of polyacryl amide gel segregation of cell 

protean containing homologous (trp E) - heterologous fusion 
proteins for the production of, respectively, somatostatin, 
thymosin alpha 1, human prolnsulln, and the A and B chains of 
human Insulin. 
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Figures 12 and 13 illustrate in successive stages the manner in 
which the plasmid created by the scheme of Figures 8-10 
inclusive is manipulated to form a system in which other 
heterologous genes. may be interchangeably expressed- as fusions 
with trp E polypeptide sequences. 



In the Figures, only the coding strand of the double-stranded 
plasmid and linear ONAs are depicted In most instances, for clarity in 
illustration. Antibiotic resistance-encoding genes are denoted ap 

a e 

(ampicillin) and tc (tetracycline). 'The legend tc connotes a gene 
10 for tetracycline resistance that Is not under the control of a 

promoter-operator system, such that plasmids containing the gene will 

c 

•nevertheless be tetracycline sensitive. The legend "ap " connotes 
ampicillin sensitivity resulting from deletion of a portion of the gene 
encoding ampicillin sensitivity. Plasmidic promoters and operators are 
15 denoted "p" and "o". The- letters A, T, 6 and C respectively connote the 
nucleotides containing the bases adenine, thymine, guanine and 
cytosine. Other Figure legends appear from the text. 



The preferred embodiments of the Invention described below involved 
•use of a number of commonly available restriction endonucleases next 
20 identified, with their corresponding. recognition sequences and 
(indicated by arrow) cleavage patterns. * 



25 



30 



Xbal: 



EcoRI; 



Bglll; 



PvuII 



BamHI: 



1^ 



CTA6A 
AGATCjT 

gLttc 

CTTAA^ 



GATCT 
TCTAGA 

i 

GAGCTG 
6TCGAC 

6GATCC 

CCTAGG 
t 



TaqI; 



Hindlll: 



Hpal: 



PstI: 



CGA 
AGty 

MGCn 
nCGAA 

t 

GHAAC 
CAA^HG 

CTGci^G 

GAC6TC 
t 
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Where the points of cleavage are spaced apart on the respective strands 
the cleaved ends will be -sticky-, ie. capable of reannealing or of 
annealing to other cotnplententarily -sticky--ended DNA by Watson-Crick 
base pairing (A to T and 6 to C)* in mortise and tenon fashion. Some 
5 restriction enzymes, such as Hpal and PvuII above, cleave to leave 
-blunf ends. The nucleotide sequences above are represented in 
■ accordance with the convention used'^hroughout: upper strand is the 
protein encoding strand, and in proceeding from left to right oo that 
strand one moves from the 5* to the 3' end thereof, ie, in the direction 
10 of transcription from a -proximal- toward a "distal" point. 

Finally with regard to conventions, the symbol "a- connotes a 
deletion. Thus, for example, reference to a plasmid followed by. say. 
■aEcoRI-Xbal- describes the plasmid from which the nucleotide sequence 
between EcoRI and Xbal restriction enzyme sites has been removed by 
15 digestion with those enzymes. For convenience, certain deletions are 
denoted by number. Thus, beginning from the first base pair ("bp-) of 
the EcoRI recognition site which precedes the gene for tetracycline 
resistance in the parental plasmid pBR322.-Al- connotes deletion of 
bpl-30 (ie. AEcoRI-Hind III) and consequent disenabling of the 
20 tetracycline promoter-operator system; -a2- connotes deletion of bp 1-375 
(ie. aEcoRI-BamHI) and consequent removal of both the tetracycline 
promoter-operator and the structural gene which encodes tetracycline 
resistance; and -a3" connotes deletion of bp 3611-4359 (ie. APstl-EcoRI) 
and elimination of ampicillin resistance. -a4- is used to connote 
25 removal of bp -900 --1500 from the trp operon fragment 5 (Fig. I), 
eliminating the structural gene for the trp D polypeptide. 



DETAILED DESCRIPTION 



30 



The trp leader sequence is made up of base pairs ("bp") 1-162. 
starting from the start point for trp mRNA. A fourteen amino acid* 
putative trp leader polypeptide is encoded by bp 27-71 following the ATG 
nucleotides which encode the translation start signal. The trp 
attenuator region comprises successive GC-rich and AT-rich sequences 
lying between bp 114 and 156 and attenuation is apparently effected on 
mRNA nucleotides encoded by bp -134-141 of the leader sequence. To 
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express a heterologous polypeptide under the direction of the trp leader 
ribosome binding site and at the same time avert attenuation, the 
following criteria must be observed: 

1. Base pairs 134-141 or beyond oust be deleted; 
5 2. The ATG codon of the Inserted gene oust be positioned in 

correct relation to a ribosome binding site, as is known (see, 
eg.^J.A, Steitz "Genetic signals and nucleotide sequences in 
mess'enger RNA" in Biological Regulation and Control (ed. R. 
Goldberger) Plenum Press, N.Y. (1978). 
Id 3. Where a homologous-heterologous fusion protein is to be 
produced, the translation start signal of a homologous 
polypeptide sequence should remain available, and the codons 
for the homologous portion of the fusion protein have to be 
inserted in phase without intervening translation stop signals. 
15 For example, deleting all base pairs within the leader sequence 
distal from. bp. 70 removes the attenuator region, leaves the ATG 
sequence which encodes the translation start signal, and eliminates the 
intervening translation stop encoded by TCA (bp. 69-71), by eliminating 
A and following nucleotides. Such a deletion would result in expression 
20 of a fusion protein beginning with the leader polypeptide, ending with 
that encoded by any heterologous insert, and including a distal region 
of one of the post-leader trp operon polypeptides determined by the 
extent of the deletion in the 3' direction. Thus a deletion extending 
into the-.E gene would lead to expression of a homologous precursor 
25 comprising the L sequence and the distal region of E (beyond the 
deletion endpoint) fused to the sequence encoded by any following 
insert, and so on. 

Two particularly useful plasmids from which the attenuator region 
has been deleted are the plasmids pGMl and pGM3, 6.F. Miozzari et al. 

30 J. Bacteriology 133. 1457 (1978). These respectively carry the 

deletions trp aLE 1413 and trp aLE 1417 and express (under the control 
of the trp promoter-operator) a polypeptide comprising approximately the 
first six amino acids of the trip leader and distal regions of the E 
polypeptide. In the most preferred case, pGMl, only about the last 

35 third of the E polypeptide is expressed whereas p6H3 expresses almost 
the distal one half of the E polypeptide codons. E. coll. K-12 strain 
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W3110 tna 2"trp"al02 containing pGMl has been deposited with the 
American Type Culture Collection (ATCC no. 31622). pGMl may be 
conventionally removed from the strain for use In the procedures 
described below. 

Alternatively, deletions may be effected by means provided by the 
Invention for specifically cleaving double-stranded ONA at any desired 
site. One example of this cleavage technique appears from Part IV of 
the experimental section, infra . Thus, double-stranded DNA Is converted 
to single-stranded DNA In the region surrounding the Intended cleavage 
10 point, as by reaction with lambda exonuclease. A synthetic or other 
single-stranded DNA primer Is then hybridized to the single-stranded 
length earlier formed, by Watson-Crick base-pairing, the primer sequence 
being such as to ensure that the 5' end thereof will be coterminous with 
the nucleotide on the first strand Just prior to the Intended cleavage 
15 point. The primer is next extended in the 3' direction by reaction with 
DNA polymerase, recreating that portion of the original double-stranded 
DNA prior to the intended cleavage that was 'lost in the first step. 
Simultaneously or thereafter, the portion of the. first strand beyond the 
Intended cleavage point is digested away. To summarize, where "v" marks 
20 the intended cleavage point: 

*^ ]X intended cleavage point "v" 



made single stranded 
around "v" 

primer hybridization 



extension from primer 



single strand digestion 



b) 



25 c) 



d) 



e) 
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In the most preferred embodiment, steps (d) and (e) are performed 
simultaneously, using a polymerase that simultaneously digests the 
protruding single stranded end In the 3'.» 5' direction and extends the 
primer (in the presence of dATP, dGTP, dTTP and dCTP) in the 5' > 3* 

5 direction. The material preferred for this purpose Is Klenow Polymerase 
I, ie, that fragment obtained by proteolytic cleavage of DNA Polymerase 
I which contains the 5' > 3* polymerizing activity and the 3' »> 5' 
exonucleolytic activity of the parental enzyme, yet lacks its 5' » 3* 
exonucleolytic activity. A. Romberg, DNA Synthesis. 98, H.H. Freeman 

10 and Co., SFO (1974). 

Using the procedure just described, attenuator deletions may be 
made In any desired manner in a trp operon-containing plasmid- first 
linearized by, eg, cleavage at a restriction site .downstream from the 
point at which the molecule Is to be blunt-ended ("v* above). 
15 Recircularizatlon following deletion of the attenuator region may be 
effected, eg, by blunt end ligation or other manners which will be 
apparent to the art-skilled.- 

Although the Invention encompasses direct expression of 
heterologous polypeptide under the direction of the trp promoter- 

20 operator, the preferred case involves expression of fused proteins 
containing both homologous and heterologous sequences, the latter 
preferably being specifically cleavable from the former In 
extra-cellular environs. Particularly preferred are fusions In which 
the homologous portion comprises one or more amino acids of the trp. 

25 leader polypeptide and about one-third or more of the trp E amino acid 
sequence (distal end). Fusion proteins so obtained appear remarkably 
stabilized against degradation under expression conditions. 
• 

Bacteria E. coH K-12 strain WSllO tna 2'trp"Al02 (p6Ml)., ATCC 
No. 31622, may be used to amplify stocks of the pSMl plasmid preferably 
employed in constructing the attenuator-deficient trp promoter-operator 
30 systems of the invention. This strain is phenotyplcally trg, in the 
presence of anthranllate and can be grown in minimal media such as 18 
supplemented with 50 wg/ml anthranllate. 
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All bacterial strains used in trp promoter-operator directed 
expression according to. the Invention are trp repressor* ("trp R*") 
as in the case of wild-type £. coll . so as to ensure repression until 
heterologous expression is Intended. 

DNA recombination is. In the preferred embodiment, performed in 
E. coll. K-12 strain -294 (end A, thi", hsr", hsn^), ATCC No. • 
31446, a bacterial strain whose membrane characteristics facilitate 
transformations. Heterologous polypeptide-prodiicing plasmids* grown in 
strain 294 are conventionally extracted and maintained In solution (eg, 
lOmM tris, ImM EDTA,pH8) at from about -20*C to about 4*C. For 
expression under industrial conditions, on the other hand, we prefer a 
more hardy strain, ie, E. coli K-12 xT" RV 308 str'*, gal 308* ' 
ATCC No. 31608. RV 308 is nutritionally wild-type and grows well In 
minimal media, synthesizing all necessary macromolecules from 
conventional mixes of anmonlum, phosphate and magnesium salts, trace 
metals and glucose. After transformation of RV 308 culture with strain 
294-derived p'lasmid the culture is plated on media selective for a 
marker (such as antibiotic resistance) carried by the plasmid, and a 
transformant colony picked and grown In flask culture. Aliquots of the 
latter in 10% OMSO or glycerol solution (in sterile Wheaton vials) are 
shell frozen in an ethanol-dry ice bath and frozen at -80*C. To produce 
the encoded heterologous polypeptide the culture samples are grown up in 
media containing tryptophan so as to repress the trp promoter-operator 
and .the system then deprived of additive tryptophan to occasion 
expression. 

For the first stage of growth one may employ, for example, LB 
medium (J.H. Miller, Experiments In Molecular Genetics. 433. Cold Spring 
Harbor Laboratory 1972) which contains, per liter aqueous solution, lOg 
Bacto tryptone, 5g Bacto yeast extract and lOg NaCl. Preferably, the 
inoculant is grown to optical density ("o.d.") of 10 or more (at 550 
nM), more preferably to o.d. 20 or more, and most preferably to o.d. 30 
or more, albeit to less than stationary phase. 
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For derepression and expression the inoculant Is next grown under 
conditions which deprive the cell of additive triptophan. One 
appropriate media for such growth is M9 (J.H. Miner, supra at 431) 
prepared as follows (per liter): 

* • • • 

5 KHgPO^ . 3g . 

NagHPO^ 6g 
liaCl O.Sg - * 

• • • • • _ • 

Autoclave, then-add: 

* 

10 10 ml 0.01M CaCI^ 

I ml 1M MgSO^ . • ... 

10 ml 20Z glucose 
Vitamin Bl Ipg/ral 

m 

Humkq hycase amino 
15 or OIFCO cas. amino acids 40 wg/ml. 

* . • 

The amino acid supplement is a tryptophan-lacking acid hydrqlysate of 
casein. 

• 

To commence expression of the heterologous polypeptide the 
Inoculant grown In tryptophan-rlch media may, eg, be diluted Intoa- 

20 larger volume of medium containing no additive tryptophan (for example, 
2-10 fold dilution) grown up to any desired level (preferably short of 
stationary growth phase) and the Intended product conventionally 
obtained by lysis, centrifugatlon and purification. In the 
tryptophan-deprived growth stage, the cells are preferably grown to od 

25 in excess of 10, more preferably in excess of od 20 and most preferably 
to or beyond od 30 (all at 550 nM) before product recovery. 

All DNA recombination experiments described In the Experimental 
section which follows were conducted at Genentech Inc. In accordance 
with the National Institutes of Health Guidelines for Recombinant DNA 
30 research. 
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I. Expression of 0-polypeptide fusion protein 00367 7& 

A preferred method of expressing fusion proteins comprising desired 
polypeptides and, fused thereto, a portion of the amino acid sequence of 
the trp 0 polypeptide that is separable in vitro by virtue of a 
methionine amino acid specifically sensitive to CNBr cleavage, \s 
described with reference to Figures 1-3. 

A. Construction of pBRHtrp 

Plasmid p6Ml (1^, Fig. 1) carries the E. coli tryptophan operon 
containing the deletion ALE1413 (G.F. Miozlari, et al., (ig78) J/ 
Bacteriology 1457-1466)) and hence expresses a fusion protein compristng 
the first 6 amino acids of the trp leader and approximately the last . 
third of the trp E polypeptide (hereinafter referred to in conjunction 
as LE'). as well as the trp 0 polypeptide in its entirety, all under the 
control of the trp promoter-operator system. The plasmid, 20 yg, was 
15 digested with the restriction enzyme PvuII which cleaves the plasmid at 
five sites. The gene fragments 2 were next combined with EcoRI linkers 
(consisting of a self complementary oligonucleotide 3 of the sequence: 
pCATGAATTCATG) providing an EcoRI cleavage site for a later cloning into 
a plasmid containing an EcoRI site (20). The 20 wg of DNA fragments 2 
obtained from pGMl were treated with 10 units T^.DNA ligase in the 
presence of 200 pico moles of the 5'-phosphorylated synthetic 
oligonucleotide pCATGAATTCATG (3) and in 20ul T^ ONA ligase buffer 
(20mM tris. pH 7.6, 0.5 mM ATP, 10 mM MgClg. 5 mM dithiothreitol) at 
4*C overnight. The solution was then heated 10 minutes at 70'C to halt 
25 ligation. The linkers were cleaved by EcoRI digestion and the 
fragments, now with EcoRI ends were separatejd using 5 percent 
polyacryl amide gel electrophoresis (herein after "PAGE") and the three 
largest fragments isolated from the gel by first staining with ethidium 
bromide, locating the fragments with ultraviolet light, and cutting from 
the gel the portions of interest. Each gel fragment, with- 300 
microliters O.lxTBE, was placed in a dialysis bag and subjected to 
electrophoresis at 100 v for one hour In O.lxTBE buffer (T8E buffer 
contains: 10.8 gm tris base, 5.5 gm boric acid, 0.09 gm NagEDTA in 1 
liter HgO). The aqueous solution was collected from the dialysis 
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bag, phenol extracted, chloroform extracted and made 0.2 M sodium 
chloride, and the ONA recovered In water after ethanol 
precipitation. [All ONA fragment Isolations hereinafter described, 
are performed using PAGE followed by the electroelutlon method just 
5 discussed]. The trp promoter-operator-containing gene with EcoRI 
sticky ends £was Identified In the procedure next described, which 
entails the Insertion of fragments Into a tetracycline sensitive 
plasmid 6^ which, upon promoter-operator Insertion, becomes 
tetracycline resistant. 

• • • • * 

* • • - 

10 B. Creation of the plasmid pBRHtrp expressing tetracycline 
resistance under the control of the trp promoter-operator and 
Identification and amplification of the trp promoter-operator 
containing ONA fragment 5, Isolated In (A.) above. 

Plasmid pBRHl (6^), (R.I. Rodriguez, et al_.. Nucleic Acids 
15 Research 6^, 3267-3287 [1979]) expresses amplcllln resistance and 
contains the gene for tetracycline resistance but, there being no 
associated promoter, does not express that resistance. The plasmid 
is accordingly tetracycline sensitive. By introducing a 
promoter-operator system in the EcoRI site, the plasmid can be made 
20 tetracycline resistant. 

pPRHl was digested with EcoRI and the enzyme removed by phenol 
extraction followed by chloroform extraction and recovered In watet^ 
after ethanol precipitation. The resulting DNA molecule 7^ was, In 
separate reaction mixtures, combined with each of the three DNA 

25 fragments obtained In part A. above and llgated with T^ ONA llgase 
as previously described. The ONA present in the reaction mixture was 
used to transform competent £. coll K-12 strain 294, K. Bacicman et 
al., Proc Nafl Acad Scl USA 73, 4174-4198 [1976]) (ATCC no. 31448) 
by standard techniques (V. Hers hfl eld et al., Proc Nat'l Acad Sci USA 

30 71, 3455-3459 [1974]) and the bacteria plated on LB plates containing 
20 wg/ml ampicillln and 5 ug/ml tetracycline. Several 
tetracycline-reslstant colonies were selected, plasmid ONA Isolated 
and Ihe presence of the desired fragment confirmed by restriction 
enzyme analysis. The resulting plasmid 8, designated pBRHtrp, 
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expresses 6-lactamase, Imparting amplclllln resistance, and It 
contains a ONA fragment Including the trp promoter-operator and 
encoding a first protein comprising a fusion of the first six amino 
acids of the trp leader and approximately the last third of the trp E 
5 polypeptide (this polypeptide Is designated LE'), and a second 

protein corresponding to approximately the first half of the trp 0 • 
polypeptide (this polypeptide Is designated D'), and a third protein 
coded for by the tetracycline resistance gene. 

C. Cloning genes for various end-product polypeptides and expression 
10 of these as fusion proteins comprising end-product and specifically 
cleavable trp 0 polypeptide precursor (Figure 2). 

A ONA fragment comprising the trp promoter-operator and codons 
for the LE* and D' polypeptides was obtained from plasmid pBRHtrp and 
Inserted Into plasmids containing structural genes for various 
15 desired polypeptides, next^exemplifled for the case of somatostatin 
(Figure 2). 

pBRH trp was digested with EcoRI restriction enzyme and the 
resulting fragment 5^ Isolated by PAGE and electroelution. 
EcoRI-digested plasmid pSom 11 (X. Itakura et a1. Science 198, 1056 

20 (1977); G.6. patent publication no. 2 007 676 A) was combined with 
fragment 5^. The mixture was ligated with ONA ligase as 
previously described and the resulting DNA transformed Into t, coli 
K-12 strain 294 as previously described. Transfonnant bacteria were 
selected on ampicill in-containing plates. Resulting 

25 ampicill in-resistant colonies were screened by colony hybridization 
(M. Gruenstein et al., Proc Hafl Acad Sci USA 72. 3951-3965 [1975]) 
using as a probe the trp promoter-operator-containing fragment S 
Isolated from pBRKtrp. which had been radioactively labelled with 

3? 

P Several colonies shown positive by colony hybridization were 
30 selected, plasmid DNA was isolated and the orientation of the 
inserted fragments determined by restriction analysis employing 
restriction enzymes Bglll and BamHI in double digestion. E. coli 294 
containing the plasmid designated pS0M7a2, 21* which has the trp 
promoter-operator fragment in the desired orientation was grown in LB 
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medium containing 10 wg/m1 amplcillln. The cells were grown to 
optical density 1 (at 550 nM), collected by centrlfugatUn and 
resuspended In M9 nedfa In tenfold dilution. Cells were grown for 
2-3 hours, again to optical density 1, then lysed and total cellular 

5 protein analyzed by SOS (sodium dodcyl sulfate) urea (15 percent) 
polyacrylaraide gel electrophoresis (J.V. Halzel Jr. et al., Heth 

. Viral 5, 180-246 [1971]). 

Figure 3 Illustrates a protein gel analysis In which total 
protein from' various cultures is separated by size. The density of 

10 Individual bands reflects the quantity In whicK the respective * 
proteins are present. With reference to Figure 3, lanes 1 and 7 are 
controls and comprise a variety of proteins of previously determined 
size which serve as points of comparative reference. Lanes 2 and 3 
segregate cellular protein from colonies of E_. coll 294 transformed 

15 with plasmid pSom7 a2 and respectively grown In LB (lane 2) and M9 
(lane 3) media. Lanes 4 and 5 segregate cellular protein obtained 
from similar cells transformed with the plasmid pTho7 a2, a thymosin 
expression plasmid obtained by procedures essentially Identical to 
those already described, beginning with the plasmid' pThal (see the 

20 commonly assigned US patent application of Roberto Crea and Ronald B. 
Wetzel, filed February 28, 1980 for thymosin Alpha 1 Production, the 
disclosure of which is incorporated herein by reference). Lane 4 
segregates cellular protein f rom E^. col1_ 294/pTho7 a2 grown in LB 
media, whereas lane 5 segregates cell protein from the same- 

25 transformant grown in M9 media. Lane 6, another control, is the 
protein pattern of E, coll 294/pBR322 grown In LB. 

* • 

Comparison to controls shows the uppermost of the two most 
prominent bands in each of lanes 3 and 5 to be proteins of size 
'anticipated in the case of expression of a fusion protein comprising 
30 the D' polypeptide and, respectively, somatostatin and thymosin (the 
other prominent band represents the LE' polypeptide resulting from 
deletion of the attenuator). Figure 3 confirms that expression Is 
repressed in tryptophan-rlch media, but derepressed under tryptophan 
deficient conditions. 
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D. Cyanogen bromide cleavage and radioimmunoassay for hormone product 

* • • • 

For both the thymosin and somatostatin cases, total cellular 
protein was cyanogen bromide-cleaved, the cleavage product recovered 
. and, after drying, was resuspended'ln buffer and analyzed by radio- 
5 Imnunoassay, confirming the expression of product lonunologlcally 
identical, respectively, to somatostatin andrthymosln. Cyanogen 
bromide cleavage was as described In D.V. Goeddel et jH., Proc Nat'l 
.Acad Sci USA 76. 106-110 [1979]). " " 

11. Construction of plasmids for direct expression of heterologous 
10 genes under control of the trp promoter-operator system 

The strategy for direct express ion entailed creation of a 
plasm.id containing a unique restriction site distal from all control 
elements of the trp operon into which heterologous genes could be 
cloned in lieu of the trp leader sequence and in proper, spaced 
15 relation to the trp leader polypeptide's ribosome binding site. The 

• direct expression approach is next exemplified for the case of human 
growth hormone expression. 

The plasmid pSom7 a2, lOug, was cleaved w4th EcoRI and the DNA 
fragment 5^ containing the tryptophan genetic elements was Isolated by 

20 PAGE and electroelution. This fragment. 2Mg, was digested with the 
restriction endonuclease Taq I. 2 units, 10 minutes at 37*C such 
that, on the average, only one of. the approximately five Taq I sites 
in each molecule is cleaved. This partially digested mixture of 
fragments was separated by PAGE and an approximately 300 base pair 

25 fragment \Z (Fig. 4) that contained one EcoRI end and one Taq I end 
was Isolated by electroelution. The corresponding Taq I site is 
located between the transcription start and translation start iites' 
and is 5 nucleotides upstream from the ATG codon of the trp leader 
peptide. The ONA sequence about this site is shown in Figure 4. By 

30 proceeding as described, a fragment could be Isolated. containing all 

control elements of the trp operon. i.e., promoter-operator system. 

transcription Initiat.ion signal, and trp leader ribosome binding 
site. 
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The Taq I'resldue at the 3' end .of the resulting fragment 
adjacent the translation start signal for the trp leader sequence was 
next converted into an Xbal site, as shown in Figure 5. This was 
done by ligating the fragment 12 obtained above to a plasnid 
5. containing a unique (i.e., only one) EcoRI site and a unique Xbal 
site. For this purpose, one nay employ essentially any plasnid 
containing; in order, a repycon, a selectable narker such as • - 
antibiotic resistance, and EcoRI, Xbal aiid BamHI sites. Thus, for 
: example, an Xbal site, can be Introduced between the EcoRI and -BamHI 
10 sites of PBR322 (F. Bol ivar et al. , Gene 2, 95-119 [1977]) by, e.g., 
cleaving at the plasmid's unique Hind III site with Hind III followed 
by single strand-specific nuclease digestion of the resulting sticky 
ends, and bjunt end ligation of a self annealing double-stranded 
synthetic nucleotide containing the recognition site such as 
15 CCTCTAGAGG. Alternatively, naturally derived DNA fragments may be' 
employed, as was done in the present case, that contain a single Xbal 
site between EcoRI and BamHI cleavage residues. Thus, an EcoRI and 
BamHI digestion product of the viral genome of hepatitis B was 
obtained by conventional means and cloned into the EcoRI and BamHI 
20 sites of plasmid p6H6 (O.V. Goeddel et ai-. Nature 281. 544 [1979])) 
to form the plasmid pHS32. Plasmid pHS32 was cleaved with Xbal, 
phenol.extracted, chloroform extracted and ethanol precipitated. It 
was then-.treated with 1 yl £. coli polymerase I, Klenow fragment 
(Boehrjnger-Mannheim) in 30 ul polymerase buffer (50 mM potassium 
25 phosphate pH 7.4, 7rrM MgClg. 1 mM fl-mercaptoethanol ) containing 
O.lmM dTTP and O.lmM dCTP for 30 ninutes at O'C then 2 hr. at 37"C. 
This treatment causes 2 of the 4 nucleotides complementary to the 5' 
protruding end of the Xbal cleavage site to be filled in: 

5* CTA6A S* CTA6A 

30 . 3' T— ^ y TCT— 

Two nucleotides, dC and dT, were incorporated giving an end with two 
5' protruding nucleotides. This linear residue of plasmid pHS32 
(after phenol and chloroform extraction and recovery in water after 
ethanol precipitation) was cleaved with EcoRI. The large plasmid . 
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fragment 21 was separated from the smaller EcoRI-Xbal fragment by 
PAGE and isolated after electroelution. This DNA fragment from pHS32 
(0.2 tig)> was ligated, under conditions similar to those described 
above, to the EcoRI-Taq I fragment of the tryptophan operon (-0.01 
5 yg), as shown in Figure 5. In this process the Taej I protruding end 
• is ligated to the Xbal remaining -protruding end even though it is not 
completely Watson-Crick base-paired: 

T + CTAGA TCTAGA— 

AGC TCT " ^ — AGCTCT 

10 A portion of this ligation reaction mixture was transformed into E. 
coli 294 cells as in part I. above, heat treated and plated on LB 
plates containing ampicillin. Twenty-four colonies were selected, 
grown in 3 ml LB media, and plasmid isolated. Six of these were 
found to have the Xbal site regenerated via E. coli catalyzed DNA 

15 repair and replicationr 

TCTAGA TCTAGA 

AGCTCT ""^ ^ AGATCT 

These plasmids were also found to cleave both with EcoRI and Hpal and 
to give the expected restriction fragments. One plasmid 14, desig- 
20 nated pTrp 14, vas used for expression of heterologous polypeptides, 
as next discussed. 

The plasmid pHGH 107 (18 in Figure 6, D.V. Goeddel et al. Nature . 
281 . 544, 1979) contains a gene for human growth hormone made up of 
23 amino acid codons produced from synthetic DNA fragments and 163 

25 amino acid codons obtained from complementary DNA produced via 
reverse transcription of human growth hormone messenger RNA. This 
gene 21, though it lacks the codons of the *pre* sequence of human 
growth hormone, does contain an ATG translation initiation codon. 
The gene was isolated from 10 ug pHGH 107 after treatment with EcoRI 

30 followed by E. coli polymerase I Klenow fragment and dTTP and dATP as 
described above. Following phenol and chloroform extraction and 
ethanol precipitation the plasmid was treated with BamHI. See Figure 
6. 
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The human growth hormone ("HGH") gene-containing fra^nent 21 was 
Isolated by PAGE followed by electroelutlon. The resulting DNA 
fragment also contains the first 350 nucleotides of the tetracycline 
resistance structural gene, but lacks the tetracyllne 

5 promoter-operator system so that, when subsequently cloned Into an 
expression plasmid, plasnlds containing the Insert can be located by 
the restoration of tetracycline resistance. Because the EcoRI end of 
the fragment 21 has been filled in by the Klehow polymerase I- 
procedure, the fragment has one blunt and one sticky end, ensuring 

10 proper orientation when later Inserted Into an expression plasmid. 
See Figure 6. 

• 

The expression plasmid pTrpl4 was next prepared to receive the 
HGH gene-containing fragment prepared above. Thus, pTrplA was Xbal 
digested and the resulting sticky ends filled In with the Klenow 

15 polymerase I procedure employing dATP, dTTP, dGTP and dCTP. After 
phenol and chloroform extraction and ethanol precipitation the 
resulting DNA 16 was treated with BamHI and the resulting large 
plasmid fragment 17 Isolated by PAGE and electroelutlon. The 
pTrpl4-derived fragment 17 had one blunt and one sticky end, 

20 permitting recombination In proper orientation with the HGH gene 
containing fragment 21^ previously described. 

• . * 

The HGH gene fragment 21 and the pTrpl4 aXba-BamHI fragment 17 
were combined and llgated together under conditions similar to those 
described above. The filled In Xbal and EcoRI ends llgated together 
25 by blunt end ligation to recreate both the Xbal and the EcoRI site: 



Xbal filled In EcoRI filled In HGH gene Initiation 

— TCTAG ^ AATTCTAT6 T^TAj^AITCTATG 

AGATC TTAAGATAC ^ AGATCjTTAAbATAC 

Xbal EcoRI 
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This construction also recreates the tetracycline resistance gene. 
Since the plasaiid pHGH 107 expresses tetracycline resistance from a 
promoter lying upstream from the HGH gene (the lac promoter), this 

• construction 22. designated pHGH 207, permits expression of the gene 
5 for tetracycline resistance under the control of the tryptophan 

• promoter-operator. Thus the ligation mixture was transfonned Into E. 
coli 294 and colonies selected on LB plates containing S wg/nl 
•tetracycline. * • 

In order to confirm the direct expression of human growth 

10 hormone from plasmid pHGH 207, total cellular protein derived from 
E.coli 294/pHGH 207 that had been grown to optical density 1 in LB 
media containing 10 ug/ral ampicillin and diluted 1 to .10 into.M9 
media, and grown again to optical density 1, was subjected to SOS gel 
electrophoresis as in the case of part I. above and compared to 

15 similar electrophoresis data obtained for human growth hormone as 
previously expressed by others (O.V. Goeddel et al, Nature . 281. 54A 
(1979)). Figure 7 is a photograph of the resulting, stained gel 
wherein: Lanes 1 and 7 contain protein markers of various known 
sizes; Lane 2 Is a control that separates total cellular protein of 

20 E. Coli strain 294 pBR322; Lane 3 segregates protein from E. Coll 
294/pHGH 107 grown in LB media; Lane 4 segregates protein from E. 
Coli 294/pHGH 107 grown in M9 media; Lane 5 segregates protein from 
E.. Coli 294/pH6H 207 grown in LB media; and Lane 6 segregates protein 
from E. Coli 294/pH6H 207 grown in M9. The dense band in Lane 6 is 

25 human growth hormone, as shown by comparison to the similar bands in 
Lanes 2-4. As predicted by the invention, the organism E. Coli 
294/pHGH 207 when grown in tryptophan-rich LB media produces less 
human growth hormone by reason of tryptophan repressor/operator 
interactions, and when grown in M9 media produces considerably more 

30 HGH than E. Coli 294/pHGH 107 owing to the induction of the stronger 
tryptophan promoter-operator system vs the lac promoter-operator 
system in pHGH 107. 

III. Creation of a general expression plasmid for the direct 
expression of heterologous genes under control of the tryptophan 
35 promoter-operator. 
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The plasmid pHGH 207 created In the preceding section was next 
used to obtain a DNA fragment containing the control elements of the 
tryptophan operon (with the attenuator deleted) and to create a 
plasmid ''expression vector" suitable for the direct expression of 

5 . various structural gene Inserts. The strategy for creation of the 
general expression plasmid Involved ranoval of the tryptophan control 
region -from pHGH 207 by EcoRI digestion and Insertion In the 
EcoRI-dlgested plasmid pBRHl used In part I. supra. pBRHl, as 
previously noted. Is an amplclllln resistant plasmid containing the 

10 tetracycline resistance gene but is tetracycline sensitive because of 
the absence of a suitable promoter-operator system. The resulting 
plasmid, pHKY 1, whose construction Is more particularly described 
below and shown In Figure 8,- Is both amplclllln and .tetracycline 
resistant, contains the tryptophan promoter-operator system, lacks 

15 the tryptophan attenuator, and contains a unique Xbal site distal 
from the tryptophan promoter-operator. The tryptophan promoter- 
operator and unique Xbal site are bounded by EcoRI sites, such that 
the promoter-operator-Xbal-contalnIng fragment can be removed for 
insertion In other structural gene-containing plasmlds. 

20 Alternatively, heterologous structural genes may be Inserted, either 
Into the Xbal site or (after partial EcoRI digestion) Into the EcoRI 
site distal from the tryptophan control region. In either case so as 
to come under the control of the tryptophan promoter-operator system. 

« 

Plasmid pHGH 207 was EcoRI digested and the trp promoter 
25 containing EcoRI fragment 23 recovered by PAGE followed by 
electroelutlon. 

Plasmid pBRHl was EcoRI digested and the cleaved ends treated 
with bacterial alkaline phosphatase ("BAP") (1 ug. In 50 oM tris pH 8 
«and 10 mM MgCl^ for 30 min. at 65'C) to rmove the phosphate groups 
30 on the protruding EcoRI ends. Excess bacterial alkaline phosphatase 
was removed by phenol extraction, chloroform extraction and ethanol 
precipitation. The resulting linear DNA 7a, because It lacks 
phosphates on the protruding ends thereof, will In ligation accept 
only Inserts whose complementary sticky ends are phosphorylated but 
35 will not Itself reclrcul arize, permitting more facile screening for 
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plasmlds containing the Inserts. The EcoRI fragment derived from 
pHGH 207 and the linear ONA obtained fron pBRHl were combined In the 
presence of llgase as previously described and llgated. A 
portion of the resulting mixture was transformed Into E. coll strain 
294 as previously described, plated on LB nedia containing 5 wg/ml of 
tetracycline, and 12 tetracycline resistant colonies selected. 
Plasmid was isolated from each colony and examined for the presence 
of a DNA Insert by restriction endonuclease analysis employlng^ EcoRI 
.and Xbal. One plasmid containing the Insert was designated pHKYl. 

IV. Creation of a plasmid containing the tryptophan operon capable 
of expressing a specifically cleavable fusion protein comprising 6 
amino acids of the trp' leader peptide and the last third of the trp E 
polypeptide (designated LE') and a heterologous structural gene 
product. 

The strategy for the creation of a LE' fusion protein expression 
plasmid entailed the following steps: 

a. Provision of a gene fragment comprising codons for the 
distal region of the LE'po.lypeptide'having Bgl II and EcoRI 
sticky ends respectively at the 5' and at the 3' ends of the 
coding strand; 

• 

b. Elimination of the codons from the distal region of the LE* 
gene fragment and those for the trp D gene from plasmid SOM 7 a2 
and insertion of the fragment formed In step 1, reconstituting 
the LE* codon sequence Immediately upstream from 

that for the heterologous gene for somatostatin. 

1. With reference to Figure 9(a), plasmid pSom7 tZ was Hind* III 
digested followed by digestion with lambda exonuclease (a 5' to 
3'exonuclease) under conditions chosen so as to digest beyond the Bgl 
II restriction site within the LE' encoding region. 20 wg of Hind 
Ill-digested pSom 7 a2 was dissolved In buffer [20mM glycine buffer, 
pH 9.6. ImM MgClp. ImM 6-mercaptoethanol]. The resulting mixture 
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was treated with 5 units of lambda exonuclease for 60 minutes at room 
temperature. The reaction mixture obtained was then phenol 
extracted, chloroform extracted and ethanol precipitated. 

In order ultimately to create an EcoRI residue at the distal end 
5- of the LE' gene fragment a primer ^^pCCTCTGCATGAT was synthesized 
by the Improved phosphotrtester method (R. Crea et al., Proc HatM 
Acad Scl USA 75, 5765 [1978]) and hybridized to the single stranded 
end of the LE' gene fragment resulting from lambda exonuclease 
digestion. The hybridization was performed as next described! 

10 ZOug of the lambda exoi^uc lease-treated Hind MI digestion 

product of plasmid pSom7 42 was dissolved in 20ul H^O and combined 
with 6wl of a solution containing approximately 80 picomoles of the 
5'-phosphorylated oligonucleotide described above. The synthetic 
fragment was hybridized to the 3' end of the LE' coding sequence and 

15 the remaining single strand portion of the LE' fragment was filled In 
by the Klenow polymerase I procedure described above, using dATP. 
dTTf , dGTP and dCTP. * . 

The reaction mixture was heated to 50*C and let cool slowly to 
lO'C, whereafter 4ul of Klenov enzyme were added. After IS minute 

20 room temperature incubation, followed by 30 minutes incubation at 
37'C, the reaction was stopped by the addition of 5pl of 0.25 molar 
EDTA. The reaction mixture was phenol extracted, chloroform 
extracted and ethanol precipitated. The ONA was subsequently cleaved 
with the restriction enzyme Bgl II. The fragments were separated by 

25 PAGE. An autoradiogram obtained from the gel revealed a 

^^P-labelled fragment of the expected length of approximately 470 
bp, which was recovered by electroelution. As outlined, this 
fragment LE'(d) has a Bgl II and a blunt end coinciding with the 
beginning of the primer. 

30 The plasmid pThol described in part I (C.) above carries a 

structural gene for thymosin alpha one cloned at its 5' coding strand 
end into an EcoRI site and at its 3* end into a BamHI site. As shown 
in Figure 9, the thymosin gene contains a Bgl II site as well. 
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Plasmid pThol also contains a gene specifying ampicillin resistance. 
In order to create a plasmid capable of accepting the L£'(d) fragment 
prepared above. pTh.l was EcoRI digested followed by Klenow 
polymerase I reaction with dHP and dATP to blunt the EcoRI 
residues. Bgl II digestion of the resulting product created a linear 
DMA fragment 33 containing the gene for anpictllin resistance and, at 
its opposite ends, a sticky Bgl II residue and a blunt end. The 
resulting product could be recircularized by reaction with the IV (dy 
fragment containing a' Bgl II sticky end and a blunt end in th^ ' 
presence of Ugase to form the plasmid pTrp24 (Fig. 9b). In 
doing so. an EcoRI site is recreated at the position where blunt end 
. ligation occurred. 

With reference to Figure 10, successive digestion ofpTrp24 with 
Bgl II and EcoRI, followed by PAGE and electroelution yields a 
15 fragment having codons for the LE'(d) polypeptide with a Bgl II 
sticky end and an EcoRI sticky end adjacent its 3'. coding terminus. 
The LE'(d) fragment 38 can be cloned into the Bgl II site of plasmid 
pSom7 a2 to form an LE' polypeptide/somatostatin fusion protein 
expressed under the control of the tryptophan promoter-operator, as 
shown in Figure 10. To do so requires (1) partial EcoRI digestion" of 
pSom? a2 in order to cleave the EcoRI site distal to the tryptophan 
promoter-operator, as shown in Figure 10 and (2) proper choice of the 
primer sequence (Figure 9) in order to. properly maintain the codon 
reading frame, and to recreate an EcoRI cleavage site. 

25 Thus. 16 wg plasmid pSom? a2 was diluted into 200 m1 of buffer 

containing 20 mM Tris. pH 7.5. 5 mN MgCl^. 0.02 NP40 detergent. 
100 enM NaCl and treated with 0.5 units EcoRI. After 15 minutes at 
37'C. the reaction mixture was phenol extracted, chloroform extracted 
and ethanol precipitated and subsequently digested with Bgl II. The 

30 larger resulting fragment 36 isolated by the PAGE procedure followed 
by electroelution. This fragment contains the codons "LE'(p)'' for 
the proximal end of the LE' polypeptide, ie, those upstream from the 
Bgl II site. The fragment 36 was next ligated to the fragment 38 in 
the presence of T^ ONA ligase to form the plasmid pSom7 a2A4. which 

35 upon transformation into E. coli strain 294, as previously described. 
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efficiently produced a fusion protein consisting. of the fully 
reconstituted LE* polypeptide and somatostatin under the control of 
the tryptophan promoter-operator. The fusion protein, from which the 
• somatostatin may be specifically cleaved owing to the i>resence of a 
S methionine at the 5' end of the somatostatin sequence was segregated 
by ^.OS polyacryl amide gel electrophoresis as previously described. 
The fusion protein product Is the most distinct band^parent In Lane 
6 of Figure 11, discussed In greater detail. In Part VI Infra; •• 

v. Creation of an expression system for trp LE' polypeptide fusions 
10 wherein tetracycline resistance Is placed under the control of the 
tryptophan promoter-operator. 

. . - . • • . . 

The strategy for creation of an expression vehicle capable of 
receiving a wide variety of heterologous polypeptide genes for 
expression as trp LE' fusion proteins under the control of the 
15 tryptophan operon entailed construction of a plasmid having the 
follov/ing characteristics: . • • •.• 

1. Tetracycline resistance which would be lost in the event of 
the promoter-operator system controlling the genes specifying 
such resistance was excised. 

• _ • 

20 2. Removing the promoter-operator system that controls 

tetracycline resistance, and recircularizing by ligation to a 
heterologous gene and a tryptophan promoter-operator system In 
proper reading phase with reference thereto, thus restoring 
tetracycline resistance and accordingly permitting 

25 Identification of plasmlds containing the heterologous gene 

Insert. 

In short, and consistent with the nature of the Intended Inserts, the 
object was to create a linear piece of DNA having a Pst residue at 
its 3' end and a Bgl II residue at Its 5' end, bounding a gene 
30 capable of specifying tetracycline resistance when brought under the 
control of a promoter-operator system. 
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Thus, with reference to figure 12, pUsnId pBR322 was Hind III 
digested and the protruding Hind III ends in turn digested with SI 
nuclease. The SI nuclease digestion involved treatment of 10 ug of 
Hind Ill-cleaved pBR322 in 30 yl SI buffer (0.3 M NaCl. 1 oM ZnCl . 

S 25 DIM sodium acetate, pH 4.5) with 300 units SI nuclease for 30 ^' 
minutes at IS'C. The reaction was stopped by the additon of 1 yl of 
30 X SI nuclease stop solution (0.8M tris base,-!50 vH EOTA). The 
mixture was phenol extracted, chloroform extracted and ethanol 
precipitated, then EcoRI digested as previously described and' the 

10 large fragment 46 obtained by PAGE procedure followed by 

electroelutlon. The fragment obtained has a first EcoRI sticky end 
and a second, blunt end whose coding strand. begins with the 
nucleotide thymidine. As will be subsequently shown, the Sl-digested 
Hind III residue beginning with thymidine can be Joined to a Klenow 

15 polymerase I-treated Bgl II residue so as to reconstitute the Bgl II 
restriction site upon ligation. 

Plasmid pSorn? a2, as prepared in Part I above, was Bgl II 
digested and the Bgl II sticky ends resulting made double stranded 
with the Xlenow polymerase I procedure using all four deoxynucleotide 

20 triphosphates. EcoRI cleavage of the resulting product followed by 
PAGE and electroelutlon of the small fragment 42 yielded a linear 
piece of ONA containing the tryptophan promoter-operator and codons 
of the UE' "proximal" sequence upstream from the Bgl II site 
(••LE»(p)"). The product had an EcoRI end and a blunt end resulting 

25 from filling in the Bgl II site. However, the Bgl II site Is 
reconstituted by ligation of the blunt end of fragment 42 to the 
blunt end of fragment 46. Thus, the two fragments were ligated in 
the presence of T^ OKA ligase to form the reclrcularized plasmid 
pHKY 10 (see Figure 12) which was propagated by transformation into 

30 competent E. coH strain 294 cells. Tetracycline resistant cells 
bearing the recombinant plasmid pKKY 10 were grown up, plasmid DNA 
extracted and digested in turn with Bgl II and Pst followed by 
Isolation by the PAGE procedure and electroelutlon of the large 
fragment, a linear piece of DNA having Pst and Bgl II sticky ends. 

35 This DNA fragment 49 contains the origin of replication and 
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subsequently proved useful as a first component in the construction 
of plasmids where both the genes coding for trp LE* polypeptide 
fusion proteins and the tet resistance gene are controlled by the trp 
promoter /operator. * ... 

5. . Plasmid pSoin? a2a4, as previously prepared In Part IV, could be 
manipulated to provide a second cooponent for a system capable of. 
receiving a wide variety of heterologous structural genes. With 
reference to Figure 13, the plasmid was subjected to partial EcoRI 
digestion" (see Part IV) followed by Pst digestion and fragment 51 

10 containing the trp promoter/operator was isolated by the PAGE 

procedure followed by electroelution. Partial EcoRI digestion was 
necessary to obtain a fragment which was cleaved adjacent to the 5' 
end of the somatostatin gene but not cleaved at the EcoRI site 
present between the ampicillin resistance gene and the trp promoter 

15 operator. Ampicillin resistance lost by the Pst I cut in the ap*^ 
gene could be restored upon ligation with fragment 51. 

In a first demonstration the third component, a structural gene 
for thymosin alpha-one was obtained by EcoRI and BamHI digestion of 
plasmid pThol. The fragment, 52, was purified by PAGE and 
20 electroelution. 

The three gene fragments 49, 51 and 52 could now be ligated 
together in proper orientation, as depicted in figure 13. to form the 
plasmid pTho7AlA4, which could be selected by reason of the 
restoration of ampicillin and tetracycline resistance. The plasmid, 

25 when transformed into E. coll strain 294 and grown up under • 
conditions like those described In Part I, expressed a trp LE* 
polypeptide fusion protein from which thymosin alpha one could be 
specifically cleaved by cyanogen bromide treatment. When other 
heterologous structural genes having EcoRI and BamHI termini were 

30 similarly ligated with the pHKYlO-derived and pS0M7 A2A4-derived 
components, trp LE' polypeptide fusion proteins containing the 
polypeptides for which those heterologous genes code were likewise 
efficiently obtained. Figure 11 illustrates an SOS polyacryl amide 
gel electrophoresis separation of total cellular protein from E. coli 
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strain 294 transformants. the darkest band in each case representing 
the fusion protein product produced under control of the tryptophan 
promoter-operator system. In Figure 11. Lane 1 is a control which 
segregates total cellular protein from E. coH 294/pBR322. Lane 2 
5 contains the somatostatin fusion product from plasmid pSooi? A2a4 
prepared in Part IV. Lane 3 is the somatostatin-containing 
expression product of 9$om7 aU4. Lane 4 contains the expression 
product of pTha7Ala4. whereas Lane 5 contains the product expressed 
from a plasmid obtained when the pHKY-lO-derived and pSom7 * 
10 A2A4-derived fragments discussed, above were ligated with an 

EcoRI/BamHI terminated structural gene encoding human proinsulin and 
prepared in part by certain of us. Lanes 6 and 7 respectively 
contain, as the darkest band, a trp LE' polypeptide fusion protein 
from which can be cleaved the B and A chain of human insulin. The 
insulin B and A structural genes were obtained by EcoRI and BamHI 
digestion of plasmids pIBl and pIAU respectively, whose construction 
is disclosed in D.V. Goeddel et al.. Proc Kat'l Acad sn i.^a 7^, loe 
[1979]. Lane 8 contains size markers, as before. 



15 



* * * 



While the invention in its most preferred embodiment is 
20 described with reference to E . coll. other enterobacteriaceae could 
likewise serve as host cells for expression and as sources for trp 
operons, among which may be mentioned as examples Salmonella 
tyDhimurium and Serratia aarcesans. Thus, the invention is not to be 
limited to the preferred embodiments described, but only by the 
25 lavrful scope of the appended claims. 
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CLAIMS ; 

1. A method of cr a ting an xpression plasnid for the 
expression of a heterologous gene which comprises the 
simultaneous ligation, in phase, of: 

(a) a first linear double-stranded DNA fragment 
containing a replicon and a gene which expresses a 
"Belectable characteristic when placed under the 
direction of a bacterial promoter, said fragment 
lacking any such promoter; 

(b) a second linear double-stranded DNA fragment 
comprising said heterologous gene; an.d 

(c) a third double-stranded DNA fragment which comprises 
a bacterial promoter; 

the ligatable ends of said fragments being configured such 
that upon ligation to form a replicable plasmid both the gene 
for the selectable characteristic and the heterologous gene 
come under the direction of the promoter, thus permitting use 
of the selectable characteristic in selection of transformant 
bacteria colonies capable of expressing the heterologous gene. 

2. The method of claim 1 wherein the selectable 
characteristic is antibiotic resistance. 

3. The method of claim 2 wherein the selectable 
characteristic is tetracycline resistance and wherein the 
bacterial promoter is the trp promoter. 

^ • • 

4. The method of claim 3 wherein ligation reconstitutes an 
operon for the expression of ampicillin resistance as well. 

5. A method of cleaving double stranded DNA at any given 
point which comprises: 

(a) converting the double stranded DNA to singl - 

stranded DNA in a region surrounding' said point; 
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(b) 



(c) 



hybridizing to the single-stranded region £ tmed in 
step (a) a complementary primer length of single- 
stranded DNA, the 5« nd of the primer lying . 
opposite the nucleotide adjoining the intended 
cleavage site; 

restoring that portion of the second strand • 

. eliminated in step (a) which lies in the 3' directicn 

from said primer by reaction with DNA polymerase in 

the presence of adenine, thymine, guanine and 

cytosine-containing deoxynucleotide triphosphates; 
and 

digesting the remaining single-stranded length of 
DNA which prottudes beyond the intended cleavage 
point. 



6. The method of claim 5 wherein steps (c) and (d) are 
performed simultaneously by reaction with DNA polymerase vihich 
polymerizes in the directicn of 5* •» 3', is exonucleolytic in the 
direction of 3' 5', but npn-exonucleolytic in the directi6n of 5' ^3'. 

7. The method of claim 6 wherein the polymerase is Klenow 
Polymerase I. 

• • ■ 

8. A plasmidic expression vehicle for the production in 

E. £oli bacteria of a heterologous polypeptide product, said 
vehicle having a sequence of double-stranded DNA comprising, 
in phase from a first 5' to a second 3' end of the coding 
strand thereof, the elements: 

(i) a bacterial trp promoter-operator system; 
(ii) nucleotides coding for a ribosome binding site for 
translation of element (iv) ; 
(iii) nucleotides coding for a translation start signal 

for translation f elem nt (iv) ; and 
(iv) a structural gene encoding the amino acid sequence 
of a heterologous polyp ptid ; 
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said sequence. comprising neither any trp att nuation 
capability nor nucleotides coding for the trp £ ribos me 
binding site. 

9/ The method of producing a polypeptide product by the 
'expression in bacteria of a structural gene coding therefor 
' which comprises: * 

(a) providing a bacterial inoculant transformed with a 
replicable plasmidic expression vehicle haviijg a 
sequence of double-stranded DNA comprising, in 
phase from a first 5* to a second 3» end of the 
coding strand thereof, the elements: 

(i) a bacterial trp promoter-operator system; 
(ii) nucleotides coding for a ribosome binding 
site for translation of element (iv); 
(iii) nucleotides coding for a translation start 
signal for translation of element (iv) ; and 
(iv) a structural . gene encoding the amino acid 
sequence 'of a heterologous polypeptide; 
said sequence comprising neither any trp attenuation 
capability nor nucleotides coding for the trp E 
ribosome binding site; 

(b) placing. the transformed inoculant in a fermentation 
vessel and growing the same to a predetermined 1 vel 
in suitable nutrient media containing additive 
tryptophan sufficient in quantity to repress said 
promoter-operator system; and 

(c) depriving said bacteria of said additive so as to 
derepress said system and occasion the expression of 
the product for which said structural gene codes. 

10. The vehicle of claim 8 or method of claim 9 wherein th 
polyp ptide expressed by said structural gene is ntirely 
heterologous. 
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11. The vehicle of claim 8 or the method of claim 9 wh t in 
th polyp ptide xpressed is a fusion protein comprising a 
heterologous polypeptide and at least a portion of the amino 
acid sequence of a homologous polypeptide*. 

12. The vehicle or method of claim 11- wherein said portion is 
a portion of the amino acid sequence of an enzyme involved in 
the biosynthetic pathway from chorismic acid to tryptophan. 

13. The vehicle or method of claim 12 wherein the het rologous 
polypeptide is a bioactive polypeptide and the fused homologous 
polypeptide is a specifically cleavable bioinactivating 
polypeptide. 

14. The vehicle or method of claim 11 wherein the homologous 
polypeptide is the trp E polypeptide and wherein said ribosome 
binding site is the ribosome binding site for the trp leader 
polypeptide. 

15. The vehicle or method of claim 11 wherein the homologous 
polypeptide is the trp D polypeptide. 

16. The vehicle or method of claim 14 wherein the fusion 
protein comprises an heterologous polypeptide and a homologous 
polypeptide which itself constitutes a fusion of about the 
first six amino acids of the trp leader polypeptide and the 
amino acid sequence encoded by at least about the distal 
third of the trp E polypeptide gene. 

17. The vehicle or claim 8 or method of claim 9 wherein the 
heterologous polypeptide comprises a recoverable polypeptide 
selected from the group consisting of human growth hormone, 
human proinsulin, somatostatin, thymosin alpha 1, the A chain 
of human insulin and the B chain of human insulin. 
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18. Th n thod of claim 8 wherein tryptophan deprivation is 
effected by cessation f additi n of said additive and by 
dilution of the fenn ntation media in which said inoculant is 
first grown up. 

19. The method of claim 18 wherein the host bacteria is 
E*. coli. 

20. The plasmids pBRHtrp, pSOM7A2, pHGH207, pHKYl, pSOM7A2A4, 
pThyo7AlA4, and pTha7A2. 



0036776 




0036776 



SOM 




' 0036776 

3A4 




4ll4^ 0036776 




0036776 




0036776 




pTrp 14 



I Klenow Pol I 
|+4dNTP's 

16 

« 

1^^/77 HI. PAGE 

filled in 





\fcoftl 
12 

iKIenowPolI 

tdATP.dHP 

20 

BamHl, PAGE 



filled in 



HGHgene 



21 



^+T4DNA Ifgase 




FIG.6 



0036776 




FIG.8 



0036776 



Tfigmosin^ene 




31 



KlenowPoII 

4dTTRdATP 7 22 



ScoRl 



yglNN filled in 
// .So/H 




+T4 DNA lipase 




M pTrp24 

FiG.9b 



0036776 




D-SOM fusion 



PSOM7A2 



\£coRl 
I partial digest 

35 



PAGE 




fco RI 
SOM 




fwRI.PAGE 



l£^(d) 
38 



I -1-74 DNA iigase 
L£-SOM fusion 




FIG.IO 



PSOM7A2A4 



» 0036776 

12/14 



8 



- • • • 



« * . 



* . • • 

■ : ' . • ■••i • 
. .-J 

........ •'J-V- 

1 



1 



Ml 
CD 




LE-SOM fusion 



39 pS0M7AEA4. 

1 Partial Digest 
50 



Pst, PAGE 



t 



thymosin ctjQens 
52 



Bam 



+T4 DNA lipase 
i.Q,=-^E -thymosin fusion 




pThyayAlM 



FIG. 12 




5 



